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Abstract 

Introduction: The analysis of polyacrylamide gels is currently carried out manually or 
automatically. In the automatic method, there are limitations related to the 
acceptable degree of distortion of lane and band continuity. The available software 
cannot deal satisfactorily with this type of situations. Therefore, the paper presents an 
original image analysis method devoid of the aforementioned drawbacks. 

Material: This paper examines polyacrylamide gel images from Li-Cor DNA 
Sequencer 4300S resulting from the use of the electrophoretic separation of DNA 
fragments. The acquired images have a resolution dependent on the length of the 
analysed DNA fragments and typically it is A/IgxA/g=3806x1027 pixels. The images are 
saved in TIFF format with a grayscale resolution of 16 bits/pixel. The presented 
image analysis method was performed on gel images resulting from the analysis of 
DNA methylome profiling in plants exposed to drought stress, carried out with the 
MSAP (Methylation Sensitive Amplification Polymorphism) technique. 

Results: The results of DNA polymorphism analysis were obtained in less than one 
second for the Intel Core™ 2 Quad CPU Q9300@2.5GHz, 8GB RAM. In comparison 
with other known methods, specificity was 0.95, sensitivity = 0.94 and AUG (Area 
Under Curve) = 0.98. 

Conclusions: It is possible to carry out this method of DNA polymorphism analysis 
on distorted images of polyacrylamide gels. The method is fully automatic and does 
not require any operator intervention. Compared with other methods, it produces 
the best results and the resulting image is easy to interpret. The presented method 
of measurement is used in the practical analysis of polyacrylamide gels in the 
Department of Genetics at the University of Silesia in Katowice, Poland. 

Keywords: Gel, Image processing, Polyacrylamide, Segmentation, 2D 



Introduction 

Modern DNA analysis is used in many areas of life sciences, from biology [1] to foren- 
sic medicine or microwave analysis [2,3]. For many such cases, the analysis of DNA is 
associated with electrophoresis carried out on polyacrylamide gels, an universal analyt- 
ical technique used to separate DNA fragments by size. The advantages of using poly- 
acrylamide gels are low cost of staining separated DNA fractions and also easily 
interpretable analysis results. The obtained results are compared manually or semi- 
automatically. The manual method involves a manual selection of lanes and bands that 
are in the analysed area. Most often this occurs by selecting interesting bands on the 
printed analysis result (Figure 1) or by placing the lines along the lane on the computer 

O© 2013 Koprowski et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative 
BIoIVIGCI CGntrsI commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
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Figure 1 Manual analysis of polyacrylamide gels in the distorted polyacrylamide gel image. The 

operator manually selects interesting bands. Their location, number, and correlation with other bands 
detected manually are recorded in the margin. Because of characteristic distortion, common software 
cannot deal with the arrangement of individual lanes in these images. 



screen. This analysis involves finding the location of the subsequent bands along the 
lane that was marked by the operator. In this case, errors occur: 

• due to different print quality depending on the type of printer (contrast, type of 
paper used etc.), 

• related to the participation of human factors, i.e.: the impact of experience, fatigue, 
sensitivity to illusions related to the impact of the expected result on the course of the 
analysis and an individual threshold of distinguishing bands from the background. 

A major difficulty is also the amount of time devoted to the same analysis and the 
lack of reproducibility of measurements. 

Known methods and software for automatic analysis have disadvantages mainly re- 
lated to the analysis of gel images in which individual bands are not located on a 
straight lane (Figure 1). There are also problems with the proper separation of lanes 
and detection of bands which are arranged close to each other. The analysis of the 
DNA fragments, observed as bands on a gel image, can thus be divided into two parts: 
the separation of lanes and the separation of bands in each lane. The result of detection 
of band positions is most often the matrix Lj^^^^ containing the value "1" in the places 
where a band occurs and "0" in the other places. The number of rows of the matrix 
Ldna corresponds to the number of positions of all the bands, and the number of col- 
umns corresponds to the number of gel lanes [4,5]. Since the matrix L^j^a is, by defin- 
ition, a binary matrix, further analysis and comparison of results for subsequent lanes 
is easy. Therefore, a key issue is appropriate separation of lanes and bands for each lane 
related to image analysis and processing. 

The first works on the analysis and processing of polyacrylamide gel images obtained 
from electrophoresis are from the 80's, for example, the works of L. Lipkln [6] or 
Stanley et al. [7]. These relate to the basic methods of analysis of image brightness for 
each lane. The authors of [8] does not include any information on how to separate indi- 
vidual lanes. The authors assume that they are arranged perfectly parallel. Similar as- 
sumptions are in [9,10]. The authors of [11] from 2001 present the analysis of 
individual bands using information about the brightness gradient. Bands are defined 
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depending on the distance between the changes of the gradient sign. This method is useless 
when two neighbouring bands are connected or there is uneven brightness on the whole 
gel. In other works, different methods of image analysis and processing are used, e.g.: active 
contour [12], the Gaussian distribution [11], fuzzy c-means algorithm [13] or statistical ana- 
lysis [14]. Another group of works is devoted to the development of these methods. For ex- 
ample, the works of J. Pizzonia [15] and L. Carol [16], GILE software (Gel-Image-Extractor) 
[17] or [18-23]. In [18], gels in large scale were analysed, [19] used the method of least 
squares, and [20] shows a method of using morphological operations (erosion) in the ana- 
lysis of ROI (Region Of Interest) of gels. The aforementioned GILE software [17] is not the 
only available software. There are other applications for automatic or semi-automatic ana- 
lyses of 2D gels, such as GelQuant [24], GelAnalyzer [25], Gel-Pro Analyzer [26], Decodon 
[27], BioNumerics 2D [28], Delta2D [29], ImageMaster 2D, Melanie [30], PDQuest [31], 
Progenesis Samespots [32] or REDFIN [33] and many others. A wide range of available pro- 
grams for gel image analysis enables to obtain satisfactory results in the case of simple gels 
with individual lanes arranged in parallel. If there are artefacts, connected lanes or bands, 
this group of software [19,24-33] allows for their manual editing. In these cases, the method 
is semi-automatic or fully manual. Therefore, more sophisticated methods of image analysis 
must be used or the analysis algorithm must be profiled precisely to the specified problem 
(a given type of gels). One such method proposed by the authors is described below. It is 
characterized by a new approach to the analysis of polyacrylamide gels which provides: fully 
automatic measurement of the band position, automatic determination of the lane position 
in cases of their local distortion, results in the form of a matrix of band occurrences (for all 
lanes). A special feature that distinguishes the approach presented below from other well- 
known methods, is the correct algorithm operation in cases of changes in lane thickness. 

Material 

This paper examines polyacrylamide gel images from Li-Cor DNA Sequencer 4300S 
resulting from the use of the electrophoretic separation of DNA fragments. The acquired 
images have a resolution dependent on the length of the analysed DNA fragments and typ- 
ically it is MgxA/g=3806x1027 pixels. The images are saved in TIFF format with a grayscale 
resolution of 16 bits/pixel. The images of banding patterns of DNA amplification products 
were obtained after digestion with two enzymes, namely Hpall and Mspl, used in the DNA 
methylome profiling method, MSAP (Methylation Sensitive Amplification Polymorphism). 

The analysis was performed on DNA isolated from plants exposed to drought stress 
at four time points ti^ t2, h and t4 and from control plants at one time point (^5), which 
gave a total of 5 points (the first two time points ti and ^2 are shown in Table 1). At 



Table 1 The arrangement of biological and technical replicates on the gel - the first two 
time points 



Time point 










t2 




biological 
repeat 


6/0/7 


biol2 




biolj 


biol2 


biols 


Enzyme 


Hpall N\sp\ 


HpaW Msp\ 


HpaW Msp\ 


HpaW Msp\ 


HpaW Msp\ 


HpaW Msp\ 


technical 
repeat 


\ 1 \ 1 


12 12 


12 12 


12 12 


12 12 


12 12 


trial number 


\ 1 ^ A 


5 6 7 8 


9 10 11 12 


13 14 15 16 


17 18 19 20 


21 22 23 24 
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each time point, DNA of 3 plants was analysed {bioli, biol2 and biolg - Table 1). The 
reaction was carried out in two technical replicates for each enzyme, for a total of four 
trials for each of the three biological replicates. The trials are arranged on gels vertically 
in successive lanes according to Table 1. Banding patterns were analysed by assessing 
the presence or absence of a band for a given track by transforming the gel image into 
a matrix consisting of "0" or "1", where "0" means no band and "1" means its presence. 
Further analysis involves designation of the dominant banding pattern for all replicates, 
both the technical as well as biological ones. What is subject to assessment are the dif- 
ferences in banding patterns between the points ti^ t2, % t4 and ^5. A total of 20 gel im- 
ages were analysed in each of 60 tracks with different levels of distortion. 

Implementation of the new method 

The new method, proposed by the authors, is based on the analysis of gel images which 
is carried out in two separate stages: analysis of lanes and analysis of bands. In both 
stages there are similar problems with the detection of objects and the removal of their 
redundancy (interference). Both the first analysis and the latter one require pre- 
filtration of the image and normalization described below. 

Preprocessing 

In the first stage, the image with a resolution of MgxA^g=3806x1027 pixels is 
subjected to filtration with a median filter whose mask size is 3x3 pixels (the result is 
the image [34-36]. The median filter mask size was chosen based on the image 
resolution and the size of possible artefacts present in the image (minor defects, CCD 
errors and noise of the electronics). In a further step, the image Lp undergoes 
normalization operation from the range of brightness levels corresponding to 16 bits 
(2^^) to the floating range 0—1. The image Lq thus obtained is subjected to further pro- 
cessing steps. 

Processing - the analysis of lanes 

The input image Lq, after filtration and normalization, was subjected to the analysis of lanes. 
It involves the operation of closure with a structural element SE sized M5£xA/5£=40xl pixels 
[37-41]. The size of the structural element SE was chosen in such a way as to highlight the 
changes in brightness between individual lanes in the gel image. When decreasing the reso- 
lution MsEi the contrast between individual lanes decreases. An increase in the resolution 
MsE causes the loss of information about the angle changes of the lane. When increasing 
NsE^ on the other hand, the contrast between adjacent lanes decreases. 

The resulting image Lq is further analysed in the subsequent rows. For each row in 
the range me[0,MG]> level differences of brightness ycM are analysed in relation to the 
filtered function yj{n) of ycM, The value of ycM is brightness for the selected m i.e. 
ycM =Lc{m,n) for m=const, while the value of yj(n) results from the filtration of ycW 
with an averaging filter sized 1x10. The averaging filter size is chosen once and is equal 
to the typical lane width, which in this case is 10 pixels. The resulting differences are 
shown in Figures 2 and 3 where differences below zero (black bands in the white back- 
ground) form the white pixels and the other ones form black pixels. The resulting 
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Figure 2 Gel input image and subsequent stages of tlie lane analysis. From the left: the input 
image Lq, the image Lq after filtration and normalization, and below, the image Lc after opening and the 
image Lq which is a binary image with automatically marked lanes. Due to the high resolution of the 
presented images, on their right side there is a chart that shows changes in brightness of a fragment. 



image is the final stage of the analysis of lanes. For further analysis, it will be used in 
conjunction with the source image Lq, 

Processing - the analysis of bands 

The analysis of band positions requires detection of each lane individually. This means 
that it should be possible to separate each lane in the image L^. Unfortunately, in prac- 
tice, due to the existence of artefacts, a complete separation of each lane is impossible. 
The lanes visible in the image Lb are often linked together, interrupted or terminated in 
the wrong place - Figure 4 [42-46]. Due to such situations, the analysis of individual 
bands and their location for individual lanes must be performed in smaller ROIs. The 
ROI size must be no less than the width of a single lane and not greater than an aver- 
age distance between the artefacts. In practice, it appears that the best results are 
obtained for ROI sizes that are 10 to 20 times the width of the lane [47,48]. In the 
analysed case, it is MroixNroi=200x200 pixels. The image is divided into Mg/Mrqi 
in rows and Ng/Nj^qi in columns. In total, for the resolution of the image Lq equal to 
MgxA/^c?=3806x1027 pixels, there are, after rounding, 95 ROIs for the analysis. For each 
ROI, the lanes were labelled, which gave the image Ljj^f), Then, band detection is car- 
ried out for each lane in an analogous manner to lane detection [49-51]. The difference 
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Figure 3 Examples of functions of chianges in brighitness for thie row of thie image matrix Lq. 

There are shown differences between yc(n) and y^n) which provide valuable information about local 
changes in brightness. Depending on the value of these differences, decisions are made about the 
detection of lanes whose width must fall within the adopted range. Narrower objects are considered as 
interference and the wider are split into smaller ones. 
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Figure 4 Image showing various artefacts that occur when analysing lanes. The images show the 
following cases: A) - correct, the lanes are continuous and have a fixed width, B) - incorrect, the lanes 
have uneven width, C), D) - incorrect, the lanes are broken and their proper identification and classification, 
without further analysis, is difficult, E) the lanes are not arranged in a straight line, F) the lanes do not have 
equal width. 
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Figure 5 Changes in the brightness of a selected lane in the image Lq. The difference between the 
brightness yp(m) calculated along the lane and its averaged result yG(/T?) is vital. Depending on the result of 
yp{rr])-yc{m), the position of markers indicating subsequent bands is determined. The chart shows the range 
PTik-mp which is the width of a sample band. 



between the brightness yp{m), calculated along the lane, and its averaged result yci^) are 
analysed. The averaging area of the input waveform yp{m) is chosen once, individually for 
each device. This area is dependent on the image resolution and typical (average) lane 
width. For the case described, it is the area of 10 pixels. Depending on the result of ypG 
{m)-yp{m)'yG{m), the location of markers which indicate the subsequent bands is deter- 
mined. Two features are considered here: the maximum difference ypG^^^^iypci^)) for 
mE[mpnij^ where nipm^k are the beginning and end of the /-th band and the width of the 
band Am^= m/^- nip. The band feature based on the pair ypQ and Am^ can be compared to 
the area under the curve yp^ ' Figure 5. The parameter Am^ which enables the separation 
of combined bands, will be used in further analysis. In practice, it is most convenient to 
adopt the acceptable range of variation of Am^ covering the range of 50-150% of a typical 
band width. Below this range (<50%), a detected object is considered to be interference, 
whereas above this range (>150%) a detected object is considered as a combination of two 
bands. Another considered parameter of thresholding ypQ was chosen on the basis of 
the analysis of sensitivity SPC and specificity TPR, The values of sensitivity and specificity 
were determined by comparing the performed automatic analysis with the manual ana- 
lysis carried out by an expert for 20 images containing 60 lanes each. Depending on the 
detection or omission of a band in any of the trials, the results were determined as false 
negative FN, false positive FP, true negative TN and true positive TP, The obtained results 
of FN, FP, TN, TP for the optimal value of Pr=9% are shown in Table 2 (5PC=0.95, 
rP7?=0.94). 

The last stage of the analysis is the conversion of individual band positions to the matrix 
Ldna in which columns correspond to subsequent lanes and rows to the location of 



Koprowski et al. BioMedical Engineering Online 2013, 12:68 
http://www.bionnedical-engineering-online.conn/content/1 2/1/68 



Page 8 of 14 



Table 2 Results of FP, FN, TP, TN, SPC and TPR obtained for the threshold Pr changed in 
the range from 0 to 30% 





FP 


FN 


TP 


TN 


SPC 


TPR 


0 


61698 


67 


6316 


1022 


0.01 


0.98 


1 


29680 


76 


6307 


32921 


0.52 


0.98 


2 


19360 


81 


6302 


43221 


0.69 


0.98 


3 


15197 


81 


6302 


47378 


0.75 


0.98 


4 


11452 


81 


6302 


51117 


0.81 


0.98 


5 


8746 


88 


6295 


53821 


0.86 


0.98 


6 


6762 


111 


6272 


55801 


0.89 


0.98 


7 


5006 


111 


6272 


57550 


0.91 


0.98 


8 


3821 


192 


6191 


58734 


0.93 


0.96 


9 


2603 


321 


6062 


59952 


0.95 


0.94 


10 


1914 


471 


5912 


60639 


0.96 


0.92 


11 


1144 


615 


5768 


61409 


0.98 


0.90 


12 


759 


690 


5693 


61792 


0.98 


0.89 


13 


502 


766 


5617 


62049 


0.99 


0.87 


14 


303 


887 


5496 


62248 


0.99 


0.86 


15 


0 


1010 


5373 


62548 


1 


0.84 


16 


0 


1193 


5190 


62548 


1 


0.81 


17 


0 


1786 


4597 


62548 


1 


0.72 


18 


0 


2206 


4177 


62548 


1 


0.65 


19 


0 


2642 


3741 


62548 


1 


0.58 


20 


0 


3126 


3257 


62548 


1 


0.51 


21 


0 


3300 


3083 


62548 


1 


048 


22 


0 


3968 


2415 


62548 


1 


0.37 


23 


0 


4454 


1929 


62548 




0.30 


24 


0 


4806 


1577 


62548 




0.24 


25 


0 


5090 


1293 


62548 




0.20 


26 


0 


5220 


1163 


62548 




0.18 


27 


0 


5249 


1134 


62548 




0.17 


28 


0 


5307 


1076 


62548 




0.16 


29 


0 


5329 


1054 


62548 




0.16 


30 


0 


5382 


1001 


62548 




0.15 



subsequent bands - Figure 6. This matrix is formed as a result of the conversion of the 
band and lane coordinates {m^s>K) for the adopted /pg and Am\ This matrix is further veri- 
fied in terms of reproducibility between technical replicates (Table 1), biological replicates 
and finally differences in the DNA structure. Then, these differences are easy to mark auto- 
matically (a comparison of adjacent columns with the operation xor). In this case, the 
matrix I/^am has a constant number of columns equal to the number of lanes, whereas the 
number of rows is variable and depends on the threshold value, namely Pr. This matrix has 
a resolution of 5x60 to 200x60 pixels for typical conditions. 

The next section shows the comparison of the quality of the obtained results with 
other methods described in other works. 
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Figure 6 Input image witii marlced and identified bands and the output matrix Ldna for A) 

Pr=0.2, B) Pr=OA, C) Pr=0.6, D) Pr=0.8. Each row of the matrix Ldna is the band position and the column 
is the subsequent lane. The matrix L^na can be directly compared for selected columns. The comparison 
concerns the assessment of conformity of performed technical and biological tests and the differences 
between the structure of the DNA. The white pixels indicate a recognized band and the black ones 
indicate the background. 

^ J 

Comparison with other methods 

The comparison of the quality of the results was carried out on 20 available gel images 
containing 60 lanes each, which gave a total of 1200 lanes. Two image analysis algorithms, 
known from the literature, were implemented; method 1 - [5] and method 2 - [11]: 

Method 1 - proposed by I. Bajla et al in [5] - Figure 7A, This method involves filtra- 
tion with a non-linear two-dimensional filter. Then, lane detection, smoothing and the 
analysis of peaks in the background area of the lane are carried out in the resulting 
image. In the next step, an operator manually corrects false results. Obtained results 
are shown in the form of bands in a gel diagram. This method is not fully automatic. 
The operator must manually correct the falsely detected bands. The number of wrongly 
identified bands varies and is highly dependent on the operator s individual features, 
mainly contrast threshold below which the band is considered as interference. 

Method 2 - proposed by Jiann-Der Lee et al. in [11] - Figure 7B. The analysis of im- 
ages is as follows. After entering the image, there is lane detection followed by band 
detection. Band detection is carried out on the basis of the enhanced fuzzy c-means al- 
gorithm. The authors suggest a nonlinear estimator based on the Gaussian kernel 
transformation. The position of the band, which is treated as an ellipse of varying sizes, 
is recognized in the range proposed by the authors. The analysis errors obtained by 
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A) 



B) 



Nonlinear 2D-filtering 

' Detection of horizontal 
l>and boundaries 

" Appropriate filter 

^ Bandbox generation 

^ Operator eliminating 
false detections 

* Calculate distances 
in the l>and boxes. 



Image acquisition _| 
Enhanced FCM J 
Lane detection ~^ 
Band detection 
DNA sequence 



c) 

d 



Image acquisition 
Median filter 
Normalization 

Morphological closing c 
Normalization in rows w 

w 

Averaging filter ^ 
Binarization < 

Division image - ROI % % 
Band detection ^ ^ 

Conversion to the ^ 

resulting matrix ^ 

Comparing the ^ 
position of the bands | | 

Figure 7 Algorithm blocic diagrams. A) - the algorithm proposed in this article, B), C) - other compared 
algorithms proposed by I. Bajla in [5] and Jiann-Der Lee in [1 1]. Visible differences relate primarily to the 
main analysis of data. In the proposed method, the analysis consists of two steps: lane detection followed 
by band detection. In the other compared methods, the main element of the algorithm is band detection, 
whereas lane identification is neither analyzed nor presented. 



the authors for the database of gel images described in [11] are 6.7% and 12.8% when 
using the N. Otsu thresholding method [52]. 
Method 3 - described in this paper - Figure 7C. 

All 3 methods were implemented according to the descriptions in [5,11] and in ac- 
cordance with the block diagrams shown in Figure 7A,B and C. The results depend on 
three elements: the degree of lane tortuosity, the threshold of band distinction and 
shifts in the position of bands for individual lanes. 

The degree of lane tortuosity influences, to a significant extent, the distinction error of 
the band position in the axis Ox, The consequence of winding lanes or their uneven thick- 
ness is that the error of the correct assignment of a band to a lane for the 1200 analysed 
lanes for methods 1 and 2 is very large and highly dependent on the amplitude of lane tor- 
tuosity (Figure 4E). In method 3, discussed in this paper, the error does not exceed a few 
per cent (the exact comparison is carried out in the next paragraph). The advantage of 
method 3 over methods 1 and 2 results from the lane area analysis. However, the problem 
with method 3 is an appropriate separation of lanes when the shift does not fall within the 
range of the ROI analysis. Although methods 1 and 2 enable manual correction of the 
obtained results, it consumes a lot of time and requires operator intervention in the results. 
Method 3, on the other hand, is fully automatic. 

The threshold of band distinction is highly dependent on the adopted methodology 
of gel image analysis. In method 1, the band width on one-dimensional waveform is 
specified. The changes in the median values in front of and behind the band are 
analysed. Based on this comparison, a new band is recognized on a given lane. Due to 
the constant band width adopted in method 1 Am^=const. This method cannot deal 
with the proper detection of bands which are close to each other, especially in situa- 
tions when the combined total width of the bands is not close to a multiple of their 
width (2*AmO, and these situations often occur in practice [50,51,53]. In method 2, two 
closely situated bands are weU separated even when their combined width is not equal 
to a multiple of their width. 
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In each method, the selection (manual or automatic [52,54]) of the brightness threshold p^, 
that determines the visibility of bands, becomes dominant. As a result, for methods 1, 2 and 
3, the threshold Pr was changed while observing changes in specificity and sensitivity of the 
ROC curves (Receiver Operating Characteristic). The results are shown in Figure 8. The best 
results were obtained for method 3, i.e.: 5PC=0.95, rP/?=0.94. In the case of methods 1 and 
2, low sensitivity and specificity are due to the lack of manual correction of the results. For 
method 1, specificity and sensitivity are as follows: SPC=0.72, TPR=0.5, For method 2, they 
are: SPC=071, TPR=072, Manual correction of the results obtained in methods 1 and 2 im- 
proves the results to the ideal values. Therefore, in the case of manual correction of the re- 
sults, they are always only slightly better than the results obtained from the presented 
automatic method 3 (5PC=0.95, TPR=OM and A/iC=0.98) - Table 3. 

Comparing the results for the analysed lanes of all gels, the following conclusions can 
be drawn: 



- method 1 enables to obtain satisfactory results for little complex analysis in which 
there is no need to analyse lane tortuosity; it is possible to manually correct the 
incorrect results, 

- method 2 enables to obtain good results for complex analysis; it is possible to 
manually correct the incorrect results, 

- method 3 enables to obtain good results fully automatically even in the case of 
winding lanes; it is fully automatic. 



In addition to methods 1 and 2, the described method 3 can be compared with many 
other known methods. These are the ones mentioned in the introduction, or for example. 




1-SPC 

Figure 8 ROC graph. Dependency of sensitivity and specificity clianges for tlie clianges in tlie cut-off 
tliresliold p^for metliods 1, 2 and 3 witliout manual correction of tlie results. Changes in Pr are in the 
range from 0 to 1 for the compared methods 1, 2 and 3. The best results were obtained for method 3 
presented in this article. Methods 1 and 2 have worse results. For manual correction of the results, possible 
in methods 1 and 2, the obtained results are perfect. 
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Table 3 Comparison results of the three methods for the analysis of polyacrylamide gel 
images 



Method 


Author 


SPC 


TPR 


1 


Bajla et al. [5] 


0.72 


0.50 


2 


Jiann-Der Lee et al. [1 1] 


0.71 


0.72 


3 


Koprowski et al. this article 


0.95 


0.94 



the method described in detail by Caridade C. in [55]. The author describes the method 
(GEIAS) of gel image analysis and the correction of their wrong position - rotation. For 12 
images (1082 bands in total) analysed in [55] the error is 9.2%. The images are also rotated 
in the angular range from -10° to 10° at 0.5° increments. The resistance of the proposed al- 
gorithm GEIAS to rotation is then analysed. The method described in [55] does not cover 
the width distortion of individual lanes which often occurs in practice. It is one of the 
major differences in comparison to method 3 proposed in this paper. 

Method 3, in the described cases, enables to obtain the best results, but it does have 
its drawbacks. These include: 

- limited resistance to lane width distortion - especially in situations when they occur 
together with the noise of image acquisition, 

- limited resistance to the decay of individual lanes - for example due to the errors in 
image acquisition or other (biological) factors, 

- the need for single introduction of selected parameters of the algorithm for each new 
gel image acquisition device. 

Elimination of these defects is difficult in practice and will be the subject of the au- 
thors' future works. 

Conclusions 

The proposed methodology for the analysis and processing of polyacrylamide gel im- 
ages enables to perform an automatic and repeatable measurement of the position of 
lanes and bands. This method is superior to the previously presented methods, de- 
scribed in [5] and [11], in cases of gel deformation. In such situations, each track width 
is different and it is difficult to identify to which lane the recognized band belongs. 

Proper identification of bands and lanes in the described method enables to obtain 
the output image Ldj^a- This image {Lona) contains information about the band posi- 
tions in rows and information about the lane positions in columns. The number of rec- 
ognized bands is highly dependent on the threshold Pr which is determined manually 
or automatically. The best results (5'PC=0.95, TPR=034^ and AUC=038) were obtained 
for pr=8%. The results are worse for other threshold values (Figure 6). 

The presented measurement method is used in practical analysis of polyacrylamide 
gels in the Department of Genetics at the University of Silesia in Katowice, Poland. 

Further studies concern the correction of the obtained results shown in Figure 6. In 
the case of differences between repeats shown in the image Lj^j^^, it is necessary to 
move back to the image Lq and analyse the brightness of the area of interest. Depend- 
ing on the comparison, correction must be made in the image I/^ata- However, this 
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method suggests a distinct approach to the analysis of lanes that involves the compari- 
son of adjacent lanes directly in the image Lq, which requires separate studies. 
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